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FTFT.D OF TECHNOLOGY 

The present invention relates to a novel DNA and a process 
for preparing a protein which possesses an activity to inhibit 
osteoclast differentiation and/or maturation (hereinafter 
called osteoclastogenesis-inhibitory activity) by a genetic 
engineering technique using the DNA. More particularly, the 
present invention relates to a genomic DNA encoding a protein 
OCIF which possesses an osteoclastogenesis-inhibitory activity 
and a process for preparing said protein by a genetic engineering 
technique using the genomic DNA. 

BACKGROUND OF THE INVENTION 
Human bones are constantly repeating a process of 
resorption and formation. Osteoblasts controlling formation 
of bones and osteoclasts controlling resorption of bones take 
major roles in this process. Osteoporosis is a typical disease 
caused by abnormal metabolism of bones. This disease is caused 
when bone resorption by osteoclasts exceeds bone formation by 
osteoblasts. Although the mechanism of this disease is still 
to be elucidated completely, the disease causes the bones to 
ache, makes the bones fragile, and may results in fracturing 
of the bones. As the population of the aged increases, this 
disease results in an increase in bedridden aged people which 
becomes a social problem. Urgent development of a therapeutic 
agent for this disease is strongly desired. Disease due to a 
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decrease in bone mass is expected to be treated by controlling 
bone resorption, accelerating bone formation, or improving 
balance between bone resorption and formation. 

Osteogenesis is expected to increase by accelerating 
proliferation, differentiation, or activation of the cells 
controlling bone formation, or by controlling proliferation , 
differentiation, or activation of the cells involved in bone 
resorption. In recent years, strong interest has been directed 
to physiologically active proteins (cytokines) exhibiting such 
activities as described above, and energetic research is 
ongoing on this subj ect . The cytokines which have been reported 
to accelerate proliferation or differentiation of osteoblasts 
include the proteins of fibroblast growth factor family (FGF: 
Rodan S. B. et al . , Endocrinology vol . 121, p 1917, 1987), 

insulin-like growth factor I (IGF-I: Hock J. M. et al.. 
Endocrinology vol . 122, p 254, 1988), insulin growth factor II 
(IGF- 1 1 : McCarthy T . et al . , Endocrinology vol. 124, p301, 1989), 
Activin A (Centrella M. et al.. Mol. Cell. Biol., vol. 11, p 
250, 1991), transforming growth factor-0, (NodaM., The Bone, 
vol. 2, p 29, 1988), Vasculotropin (VaroniqueM. etal., Biochem. 
Biophys. Res. Commun. , vol. 199, p 380, 1994), and the protein 
of heterotopic bone formation factor family (bone morphogenic 
protein; BMP: BMP-2; Yanaguchi A. et al . , J. Cell Biol. vol. 
113, p 682, 1991, OP-1; Sampath T. K. et al., J. Biol. Chem. 
vol. 2 67, p 20532. 1992, andKnutsenR. etal., Biochem. Biophys. 
Res. Commun. vol. 194, P 1352, 1993). 

On the other hand, as the cytokines which suppress 
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differentiation and/or maturation of osteoclasts, transforming 
growth factor-p (Chenu C, et. al . , Proc. Natl. Acad. Sci. USA, 
vol . 85, p 5683, 1988), interleukin-4 (Kasano K. et al . , 
Bone-Miner., vol. 21, p 179, 1993), and the like have been 

reported. Further, as the cytokines which suppress bone 
resorption by osteoclast, calcitonin (Bone-Miner., vol. 17, p 
347, 1992 ) , macrophage colony stimulating factor (Hattersley 
G. et al . , J. Cell. Physiol, vol. 137, p 199. 1988), 
interleukin-4 (Watanabe, K. et al . , Biochem. Biophys . Res. 
Commun. vol. 172. P 1035, 1990), and interferon-y (Gowen M. et 
al . , J. Bone Miner. Res., vol. 1, p 46.9, 1986) have been 

reported. 

These cytokines are expected to be used as agents for 
treating diseases accompanying bone loss by accelerating bone 
formation or suppressing of bone resorption. Clinical tests 
are being undertaken to verify the effect of improving bone 
metabolism of some cytokines such as insulin-like growth 
factor-I and the heterotopic bone formation factor family. In 
addition, calcitonin is already commercially available as a 
therapeutic agent for osteoporosis and a pain relief agent. At 
present, drugs for clinically treating bone diseases or 
shortening the period of treatment of bone diseases include 
activated vitamin D 3 , calcitonin and its derivatives, and 
hormone preparations such as estradiol agent, ipriflavon or 
calcium preparations. These agents are not necessarily 
satisfactory in terms of the efficacy and therapeutic results. 
Development of a novel therapeutic agent which can be used in 
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place of these agents is strongly desired. 

In view of this situation, the present inventors have 
undertaken extensive studies. As a result, the present 
inventors had found protein OCIF exhibiting an 
osteoclastogenesis-inhibitory activity in a culture broth of 
human embryonic lung fibroblast IMR-90 (ATCC Deposition No. 
CCL186) , and filed a patent application (PCT/JP96/00374 ) . The 
present inventors have conducted further studies relating to 
the origin of this protein OCIF exhibiting the 
osteoclastogenesis-inhibitory activity. The studies have 
matured into determination of the sequence of a genomic DNA 
encoding the human origin OCIF. Accordingly, an object of the 
present invention is to provide a genomic DNA encoding protein 
OCIF exhibiting osteoclastogenesis-inhibitory activity and a 
process for preparing this protein by a genetic engineering 
technique using the genomic DNA. 

DISCLOSURE OF THE INVENTION 


Specifically, the present invention relates to a genomic ’ 
DNA encoding protein OCIF exhibiting osteoclastogenesis- 
inhibitory activity and a process for preparing this protein 
by a genetic engineering technique using the genomic DNA. The 
DNA of the present invention includes the nucleotide sequences 
^^No. 1 and No . g ^tecic t ied '~~her'e~to-. 

Moreover, the present invention relates to a process for 
preparing a protein, comprising inserting a DNA including the 
nucleotide sequences of the sequences No. 1 and No. 2 in th e — . 
t s S equenc e Tabl e ■ into an expression vector, producing a vector 
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capable of expressing a protein having the following 
physicochemical characteristics and exhibiting the activity of 
inhibiting differentiation and/or maturation of osteoclasts, 
and producing this protein by a genetic engineering technique, 
(a) molecular weight (SDS-PAGE) : 

(i) Under reducing conditions: about 60 kD, 

(ii) Under non-reducing conditions : about 60 kD and about 
12 Or kD; 


(D) ammo 


<-* N— ■ -I- U * 




0^/ includes an amino acid sequence of^ie^S§ 

■ 3"^ of - the -&* — ■■ - ^ 

(c) affinity: 


exhibits affinity to a cation exchanger and heparin, and 
(d) thermal stability: 

( i ) . the osteoclast differentiation and/or maturation 

inhibitory activity is reduced when treated with heat 
at 70^C for 10 minutes or at 56^C for 30 minutes, 

(ii) the osteoclast differentiation and/or maturation 
inhibitory activity is lost when treated with heat 
at 90^ for 10 minutes. 


The protein obtained by expressing the gene of the present 
invention exhibits an osteoclastogenesis— inhibitory activity. 
This protein is effective as an agent for the treatment and 
improvement of diseases involving decrease in the amount of bone 
such as osteoporosis, diseases relating to bone metabolism 
abnormality such as rheumatism, degenerative joint disease, or 
multiple myeloma, and is useful as an antigen to establish an 
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immunological diagnosis of such diseases. 

BRIEF DESCRIP TION OF THE DRRWTMrs 
Figure 1 shows a result of Western Blotting analysis of 
the protein obtained by causing genomic DNA of the present 
invention to express a protein in Example 4 (iii) , wherein lane 
1 indicates a marker, lane 2 indicates the culture broth of COS7 
cells in which a vector pWESRaOCIF (Example 4 (iii))has been 
transfected, and lane 3 is the culture broth of COS7 cell in 
which a vector pWESRcc (control) has been transfected. 

BEST MODE FOR CARRYING OUT THF. INVENTION 
The genomic DNA encoding the protein OCIF which exhibits 
osteoclastogenesis-inhibitory activity in the present 
invention can be obtained by preparing a cosmid library using 
a human placenta genomic DNA and a cosmid vector and by screening 
this library using DNA fragments which are prepared based on 
the OCIF cDNA as a probe. The thus-obtained genomic DNA is 
inserted into a suitable expression vector to prepare an OCIF 
expression cosmid. A recombinant type OCIF can be obtained by 
transfecting the genomic DNA into a host organism such as various 
types of cells or microorganism strains and causing the DNA to 
express a protein by a conventional method. The resultant 

protein exhibiting osteoclastogenesis-inhibitory activity (an 
osteoclastogenesis-inhibitory factor) is useful as an agent for 
the treatment and improvement of diseases involving a decrease 
in bone mass such as osteoporosis and other diseases relating 
to bone metabolism abnormality and also as an antigen to prepare 
antibodies for establishing immunological diagnosis of such 
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diseases . The protein of the present invention can be prepared 
as a drug composition for oral or non-oral administration. 
Specifically, the drug composition of the present invention 
containing the protein which is an osteoclastogenesis- 
inhibitory factor as an active ingredient can be safely 
administered to humans and animals. As the form of drug 
composition, a composition for injection, composition for 
intravenous drip, suppository, nasal agent, sublingual agent, 
percutaneous absorption agent, and the like are given. In the 
case of the composition for injection, such a composition is 
a mixture of a pharmacologically effective amount of 
osteoclastogenesis-inhibitory factor of the present invention 
and a pharmaceutically acceptable carrier . The composition may 
further comprise amino acids, saccharides, cellulose 
derivatives, and other excipients and/or activation agents, 
including other organic compounds and inorganic compounds which 
are commonly added to a composition for injection. When an 
injection preparation is prepared using the 

osteoclastogenesis-inhibitory factor of the present invention 
and these excipients and activation agents, a pH adjuster, 
buffering agent, stabilizer, solubilizing agent, and the like 
may be added if necessary to prepare various types of injection 
agents . 

The present invention will now be described in more detail 
by way of examples which are given for the purpose of 
illustration and not intended to be limiting of the present 
invention. 


7 



Example 1 

< Preparation of a cosmid library > 

A cosmid library was prepared using human placenta 
genomic DNA (Clonetech; Cat. No. 6550-2) and pWE15 cosmid vector 
(Stratagene) . The experiment was carried out following 
principally the protocol attached to the pWE15 cosmid vector 
kit of Stratagene Company, provided Molecular Cloning: A 
Laboratory Mannual (Cold Spring Harbor Laboratory (1989) ) was 
referred to for common procedures for handling DNA, E. coli, 
and pharge . 

(i) Preparation of restrictive enzymolys ate of human-genomic 
DMA 

Human placenta genomic DNA dissolved in 750 p.1 of a 
solution containing 10 mM Tris-HCl, 10 mM MgCl 2 , and 100 mM NaCl 
was added to four 1 . 5 ml Eppendorf tubes (tube A, B, C, and D) 
in the amount of 100 |jg each. Restriction enzyme Mbol was added 
to these tubes in the amounts of 0.2 unit for tube A, 0.4 unit 
for tube B, 0.6 unit for tube C, and 0.8 unit for tube D, and 
DNA was digested for 1 hour. Then, EDTA in the amount to make 
a 20 mM concentration was added to each tube to terminate the 
reaction, followed by extraction with phenol/chloroform (1:1) . 
A two-fold amount of ethanol was added to the aqueous layer to 
precipitate DNA. DNA was collected by centrifugation, washed 
with 70% ethanol, and DNA in each tube was dissolved in 100 jj. 1 
of TE (lOmMHCl (pH 8.0) + 1 mM EDTA buff er solution, hereinafter 
called TE) . DNA in four tubes was combined in one tube and 
incubated for 10 minutes at 68°C. After cooling to room 



temperature, the mixture was overlayed onto a 10%-40 % linear 
sucrose gradient which was prepared in a buffer containing 20 
inM Tris HC1 (pH 8.0), 5 mM EDTA, and 1 inM NaCl in an centrifugal 
tube (38 ml) . The tube was centrifuged at 26, 000 rpm for 24 
hours at 20^C using a rotor SRP28SA manufactured by Hitachi, 
Ltd. and 0.4 ml fractions of the sucrose gradient was collected 
using a fraction collector . A portion of each fraction was 
subjected to 0.4% agarose electrophoresis to confirm the size 
of DNA. Fractions containing DNA with a length of 30 kb (kilo 
base pair) to 40 kb were thus combined. The DNA solution was 
diluted with TE to make a sucrose concentration to 10% or less 
and 2.5— fold volumes of ethanol was added to precipitate DNA. 
DNA was dissolved in TE and stored at 4^3. 


The pWE15 cosmid vector obtained from Stratagene Company 
was completely digested with restriction enzyme BamHI according 
to the protocol attached to the cosmid vector kit. DNA 
collected by ethanol precipitation was dissolved in TE to a 
concentration of 1 mg/ml . Phosphoric acid at the 5' -end of this 
DNA was removed using calf small intestine alkaline phosphatase, 
and DNA was collected by phenol extraction and ethanol 
precipitation. The DNA was dissolved in TE to a concentration 
of 1 mg/ml. 

(iii) Ligati on of aanomic DN A to veotnr anrt in 



1.5 micrograms of genomic DNA fractionated according to 
size and 3 \xq of pWE15 cosmid vector which was digested with 
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restriction enzyme BamHI were ligated in 20 (j.1 of a reaction 
solution using Ready-To-Go T4DNA ligase of Pharmacia Company. 
The ligated DNA was packaged in vitro using Gigapack™ II 
packaging extract (Stratagene) according to the protocol. 
After the packaging reaction, a portion of the reaction mixture 
was diluted stepwise with an SM buffer solution and mixed with 
E. coli XLl-Blue MR (Stratagene) which was suspended in 10 mM 
MgCl 2 to cause pharge to infect, and plated onto LB agar plates 
containing 50 |j.g/ml of ampicillin. The number of colonies 
produced was counted. The number of colonies per 1 nl of 
packaging reaction was calculated based on this result. 

(iv) Prepara tion of a cosmid library 

The packaging reaction solution thus prepared was mixed 
with E. coli XLl-Blue MR and the mixture was plated onto agarose 
plates containing ampicillin so as to produce 50,000 colonies 
per agarose plate having a 15 cm of diameter. After incubating 
the plate overnight at 31°C , an LB culture medium was added in 
the amount of 3 ml per plate to suspend and collect colonies 
of E. coli. Each agarose plate was again washed with 3 ml of 
the LB culture medium and the washing was combined with the 
original suspension of E. coli. The E. coli collected from all 
agarose plates was placed in a centrifugal tube, glycerol was 
added to a concentration of 20%, and ampicillin was further added 
to make a final concentration of 50 |ig/ml. A portion of the 
E. coli suspension was removed and the remainder was stored at 
-80*fc. The removed E. coli was diluted stepwise and plated onto 
an agar plates to count the number of colonies per 1 ml of 
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suspension. 

Example 2 

< Screening of cosmid library and purification of colony > 

A nitrocellulose filter (Millipore) with a diameter of 
14.2 cm was placed on each LB agarose plate with a diameter of 
15 cm which contained 50 |ig/ml of ampicillin. The cosmid library 
was plated onto the plates so as to produce 50,000 colonies of 
E. coli per plate, followed by incubation overnight at 37“^C. 
E. coli on the nitrocellulose filter was transferred to another 
nitrocellulose filter according to a conventional method to 
obtain two replica filters. According to the protocol attached 
to the cosmid vector kit, cosmid DNA in the E. coli on the replica 
filters was denatured with an alkali, neutralized, and 
immobilized on the nitrocellulose filter using a Stratalinker 
(Stratagene) . The filters were heated for two hours at SO'C 
in a vacuum oven . The nitrocellulose filters thus obtained were 
hybridized using two kinds of DNA produced, respectively, from 
5' -end and 3' -end of human OCIF cDNA as probes. Namely, a 
plasmid was purified from E. coli pKB/OIFlO (deposited at The 
Ministry of International Trade and Industry, the Agency of 
Industrial Science and Technology, Biotechnology Laboratory, 
Deposition No. FERM BP-5267) containing OCIF cDNA. The plasmid 
containing OCIF cDNA was digested with restriction enzymes Kpnl 
and EcoRI . Fragments thus obtained was separated using agarose 
gel electrophoresis. KpnI/EcoRI fragment with a length of 0.2 
kb was purified using a Q1AEX II gel extraction kit (Qiagen) . 
This DNA was labeled with 32 p using the Megaprime DNA Labeling 
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System (Amasham) (5' -DNA probe). Apart from this, a 
BamHI/EcoRV fragment with a length of 0.2 kb which was produced 
from the above plasmid by digestion with restriction enzymes 
BamHI and EcoRV was purified and labeled with 32 p (3' -DNA probe) . 
One of the replica filters described above was hybridized with 
the 5' -DNA probe and the other with the 3' -DNA probe. 
Hybridization and washing of the filters were carried out 
according to the protocol attached to the cosmid vector kit. 
Autoradiography detected several positive signals with each 
probe. One colony which gave positive signals with both probe 
was identified. The colony on the agar plate, which 
corresponding to the signal on the autoradiogram was isolated 
and purified. A cosmid was prepared from the purified colony 
by a conventional method. This cosmid was named pWEOCIF. The 
size of human genomic DNA contained in this cosmid was about 
38 kb. 

Example 3 

< De termination of the nucleotide sequence of human nCLU^genomic 
DNA > 

( i ) Subcloning of QCIF genomic DNA 

Cosmid pWEOCIF was digested with restriction enzyme EcoRI . 
After the separation of the DNA fragments thus produced by 
electrophoresis using a 0.7% agarose gel, the DNA fragments were 
transferred to a nylon membrane (Hybond -N, Amasham) by the 
Southern blot technique and immobilized on the nylon membrane 
using Stratalinker (Stratagene) . On the other hand, plasmid 
pBKOCIF was digested with restriction enzyme EcoRI and a 1.6 
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]cb fragment containing human OCIF cDNA was isolated by agarose 
gel electrophoresis. The fragment was labeled with 32 P using 
the Megaprime DNA labeling system (Amasham) . 

Hybridization of the nylon membranes described above with 
the 32 P-'labeled 1.6-kbOCIF cDNA was performed according to a 
conventional method detected that DNA fragments with a size of 
6 kb, 4 kb, 3.6 kb, and 2.6 kb. These fragments hybridized with 
the human OCIF cDNA were isolated using agarose gel 
electrophoresis and individually subcloned into an EcoRI site 
of pBluescript II SK + vector (Strategene) by a conventional 
method. The resulting plasmids were respectively named pBSE 


6, pBSE 4, pBSE 3.6, and PBSE 2.6. 

(ii) Determination of the nucleotid e sequence 

The rjucleotide sequence of human OCIF genomic DNA which 
was subcloijed into the plasmid was determined using the ABI 
Dideoxy Terminator Cycle Sequencing Ready Reaction kit (Perkin 
Elmer) and the 373 Sequencing System (Applied Biosystems) . The 
primer used for the determination of the nucleotide sequence 
was synthesized based on the nucleotide sequence of human OCIF 
DNA ^ -In th e Snqiirtnrm Tnh + nl r The nucleotide 

'I 

sequences thus determined are given 

^^Ne-; — 2 i - n — t - ho - Soquonco - T - abl e-i The 

he first exon of the OCIF gene and the 

-t/ 

the second, third, fourth, and fifth exons. A stretch of about 
17 kb is present between the first and second exons . ^ 
Example 4 

< Product ion of recombinant OCIF using COS-7 cells> 




nS? ( lo 


-1 includes 



■3 include: 
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(i) Preparation of OCIF genomic DNA expression cosmid 

To express OCIF genomic DNA in animal cells, an expression 
unit of expression plasmid pcDL-SRa 296 (Molecular and Cellar 
Biology, vol . 8, P466-472, 1988) was inserted into cosmid vector 
pWEl5 (Stratagene) . First of all, the expression plasmid 
pcDL-SRa 296 was digested with a restriction enzyme Sal I to 
cut out expression unit with a length of about 1.7 kb which 
includes an SRapromotor, SV40 later splice signal, poly (A) 
addition signal, and so on. The digestion products were 
separated by agarose electrophoresis and the 1.7-kb fragment 
was purified using the QIAEX II gel extraction kit (Qiagen) . 
On the other hand, cosmid vector pWE15 was digested with a 
restriction enzyme EcoRI and fragments were separated using 
agarose gel electrophoresis. pWEl5 DNA of 8.2 kb long was 
purified using the QIAEX II gel extraction kit (Qiagen) . The 
ends of these two DNA fragments were bluntled using a DNA 
blunting kit (Takara Shuzo) , ligated using a DNA ligation kit 
(Takara Shuzo) , and transferred into E. coli DH5 a (Gibco BRL) . 
The resultant transformant was grown and the expression cosmid 
pWESRa containing an expression unit was purified using a 
Qiagen column (Qiagen) . 

The cosmid pWE OCIF containing the OCIF genomic DNA with 
a length of about 38 kb obtained in (i) above was digested with 
a restriction enzyme Notl to cut out the OCIF genomic DNA of 
about 38 kb. After separation by agarose gel electrophoresis, 
the DNA was purified using the QIAEX II gel extraction kit 
(Qiagen) . On the other hand, the expression cosmid pWESRa: was 
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digested with a restriction enzyme EcoRI and the digestion 
product was extracted with phenol and chloroform, ethanol- 
precipitated, and dissolved in TE. 

pWESRa digested with a restriction enzyme EcoRI and an 
EcoRI-Xmnl-Notl adapter (#1105, #1156 New England 
Biolaboratory Co.) were ligated using T4 DNA ligase (Takara 
Shuzo Co. , Ltd. ) . After removal of the free adapter by agarose 
gel electrophoresis, the product was purified using QIAEX gel 
extraction kit (Qiagen) . The OCIF genomic DNA with a length 
of about 37 kb which was derived from the digestion with 
restriction enzyme Not I and the pWESRa to which the adapter was 
attached were ligated using T4 DNA ligase (Takara Shuzo) . The 
DNA was packaged in vitro using the Gigapack packaging extract 
(Stratagene) and infected with E. coli XLl-Blue MR (Stratagene) . 
The resultant transformant was grown and the expression cosmid 
pWESRa OCIF which contained OCIF genomic DNA was inserted was 
purified using a Qiagen column (Qiagen) . The OCIF expression 
cosmid pWESRa OCIF was ethanol-precipitated and dissolved in 

distilled water and used in the following analysis. 
(ii) Trans i ent expression of OCIF ggnom ic DNA and measurempnl- 
of QCIF activity 

A recombinant OCIF was expressed as described below 
using the OCIF expression cosmid pWESRaOCIF obtained in (i) 
above and its activity was measured. COS-7 (8xl0 5 cells/well) 
cells (Riken Cell Bank, RCB0539) were planted in a 6-well plate 
using DMEM culture medium (Gibco BRL) containing 10% fetal 
bovine serum (Gibco BRL) . On the following day, the culture 


15 



O 9 O s A IB 710 . O 9 0 8 9 8 


medium, was removed and cells were washed with serum-free DMEM 
culture medium. The OCIF expression cosmid pWESRaOCIF which 
had been diluted with OPTI-MEM culture medium (Gibco BRL) was 
mixed with lipophectamine and the mixture was added to the cells 
in each well according to the attached protocol . The expression 
cosmid pWESRa was added to the cells in the same manner as a 
control. The amount of the cosmid DNA and Lipophectamine was 
respectively 3 ng and 12 p.1 . After 24 hours, the culture medium 
was removed and 1.5 ml of fresh EX— CELL 301 culture medium (JRH 
Bioscience) was added to each well. The culture medium was 
recovered after 48 hours and used as a sample for the measurement 
of OCIF activity. The measurement of OCIF activity was carried 
out according to the method described by Kumegawa, M. et al . 
(Protein, Nucleic Acid, and Enzyme, Vol. 34, p 999 (1989)) and 
the method of TAKAHASHI, N. et al . (Endocrihology vol. 122, p 
1373 (1988)) . The osteoclast formation in the presence of 
activated vitamin D 3 from bone marrow cells isolated from mice 
aged about 17 days was evaluated by the induction of tartaric 
acid resistant acidic phosphatase activity. The inihibition of 
the acid phosphatase was measured and used as the activity of 
the protein which possesses osteoclastogenesis-inhibitory 
activity (OCIF) . Namely, 100 |j.l/well of a OCIF sample which 
was diluted with a -MEM culture medium (Gibco BRL) containing 
2xl0 -8 M activated vitamin D 3 and 10% fetal bovine serum was added 
to each well of a 96 well micro plate. Then, 3xl0 5 bone marrow 
cells isolated from mice (about 17-days old) suspended in 100 
jj. 1 of a -MEM culture medium containing 10% fetal bovine serum 
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were added to each well of the 96 well micro plate and cultured 
for a week at 37^ and 100% humidity under 5% C0 2 atmosphere. 
On days 3 and 5, 160 p.1 of the conditioned medium was removed 
from each well, and 160 |il of a sample which was diluted with 
a -MEM culture medium containing lxlO' 8 M activated vitamin D 3 
and 10% fetal bovine serum was added. After 7 days from the start 
of culturing, the cells were washed with a phosphate buffered 
saline and fixed with a ethanol/acetone (1:1) solution for one 
minute at room temperature. The osteoclast formation was 
detected by staining the cells using an acidic phosphatase 
activity measurement kit (Acid Phosphatase, Leucocyte, Cat. No. 
387-A, Sigma Company) . A decrease in the number of cells 
positive to acidic phosphatase activity in the presence of 
tartaric acid was taken as the OCIF activity. The results are 
shown in Table 1, which indicates that the conditioned medium 
exhibits the similar activity to natural type OCIF obtained from 
the IMR-90 culture medium and recombinant OCIF produced by CHO 
cells. 
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TABLE 1 

Activity of OCIF expressed by COS-7 cells in the conditioned 
medium 


Dilution 

1/10 1/20 1/40 1/80 1/160 1/320 

OCIF genomic DNA introduced 
Vector introduced 
Untreated 

++ ++ ++ ++ + 


"++" indicates an activity inhibiting 80% or more of 
osteoclast formation, "+" indicates an activity inhibiting 
30-80% of osteoclast formation, and indicates that no 

inhibition of osteoclast formation is observed. 

(iii) Identification of the product bv Western Blotting 

A buffer solution (10 p.1) for SDS— PAGE (0.5 M Tris— HC1, 
20% glycerol, 4% SDS, 20 |j.g/ml bromophenol blue, pH 6.8) was 
added to 10 nl of the sample for the measurement of OCIF activity 
prepared in (ii) above. After boiling for 3 minutes at lOO't, 
the mixture was subjected to 10% SDS polyacrylamide 
electrophoresis under non-reducing conditions. The proteins 
were transferred from the gel to a PVDF membrane (ProBlott, 
Perkin Elmer) using semi-dry blotting apparatus (Biorad). The 
membrane was blocked and incubated for 2 hours at 37*t) together 
with a horseradish peroxidase-labeled anti-OCIF antibody 
obtained by labeling the previously obtained OCIF protein with 
horseradish peroxidase according to a conventional method. 
After washing, the protein which has bound the anti-OCIF 
antibody was detected using the ECL system (Amasham) . As shown 
in Figure 1, two bands, one with a molecular weight of about 
120 kilo dalton and the other 60 kilo dalton, were detected in 



the supernatant obtained from the culture broth of COS-7 cells 
in which pWESRaOCIF was transfected. On the other hand, these 
two bands with a molecular weight of about 120 kilo dalton and 
60 kilo dalton were not detected in the supernatant obtained 
from the culture broth of COS-7 cells in which pWESR a vector 
was transfected, confirming that the protein obtained was OCIF. 

INDUSTRIAL APPLICABILITY 

The present invention provides a genomic DNA encoding 
a protein OCIF which possesses an osteoclastogenesis- 
inhibitory activity and a process for preparing this protein 
by a genetic engineering technique using the genomic DNA. The 
protein obtained by expressing the gene of the present invention 
exhibits an osteoclastogenesis-inhibitory activity and is 
useful as an agent for the treatment and improvement of diseases 
involving a decrease in the amount of bone such as osteoporosis, 
other diseases resulting from bone metabolism abnormality such 
as rheumatism or degenerative joint disease, and multiple 
myeloma. The protein is further useful as an antigen to 
establish antibodies useful for an immunological diagnosis of 
such diseases. 

NOTE ON MICROORGANISM 
Depositing Organization: 

The Ministry of International Trade and Industry, National 
Institute of Bioscience and Human Technology, Agency of 
Industrial Science and Technology 

Address: 1-3, Higashi-l-Chome, Tsukuba-shi, Ibaraki-ken, 
Japan 



# 

Date of Deposition: June 21, 1995 (originally deposited on 
June 21, 1995 and transferred to the international 
deposition according to the Budapest Treaty on 
October 25, 1995) 

Accession No. FERM BP-5267 
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TABLE OF SEQUENCES 


Sequence number: 1 
Length of sequence: 1316 
Sequence Type: nucleic aci<; 

Strandedness: double 
Topology: linear 
Molecular type: genomic ybNA (human OCIF genomic DNA-1) 


Sequence : 

CTGGAGACAT 

TCACCCATCT 

CACTTTACAA 

CCCTAGAGCA 

AGGCTACTCC 

TGGGGTTGGT 

TCACTCTGCA 

AGCACGGGCT 

TAAACTTGAA 

AAGAGGGGCC 

ACGGCGGAAA 

TGCGTCCGGA 

GCCCAGCGTG 

CGGGAAGGGG 

CCGGTGGCTT 

GCCCCACCTC 

TCTGCACACC 

GTCCAGGATA 

TATATAACGT 


ATAACTTGAA 

GTAAACAATT 

GTCATCAAGT 

Aagtgccaaa 

AGAAGTTCAG 
GAAGGGAACA 
GATTCTCTCT 
TTAGGGCCAA 
GATGAATGATj 
CTGTAA’ 
CTCACAG 1 
TCTOGCTGG 
TGCCCAGCCC 
CCGGGAAACC 

ttttt/cccc 

CCTGGGGGAT 

CCCCGACCGC 

GAAAAATCAC 

GATGAGCGCA 



CACTTGGCCC 

TCAGTGGCAA 

CTAAGTTCTA 

CTTyTGTCGA 

CGyGTAGGAA 

GTGCTCCGCA 

G,GCTCTAACT 

CAGACATTA 

TGCGAACTCC 

AGGTTTCAGA 

TCGCCCAGCG 

ATCGGACTCT 

TCCCACCGCT 

TCAGAGCCCC 

TGCTCTCCCA 

CCTITCCGCC 

TCCCGCCCAA 

TGATCAAAGG 

CGGGCTGCGG 


V 


TGATGGGGAA 

CCCGCGAACT 

GACCAGGGAA 

TAGCTTGAGG 

GCTCCGATAC 

AGGTTATCCC 

ACCCCAGATA 

GTTAGAAAAA 

CCGAAAAGGG 

ACCCGAAGTG 

AGAGGACAAA 

CAGGGTGGAG 

GGTCCCGGCT 

GCGGAGACAG 

GGGGACAGAC 

CCAGCCCTGA 

GCTTCCTAAA 

CAGGCGATAC 

AGACGCACCG 
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GCAGCTCTGC 

GTAATCCATG 

TTAATGGGGG 

CTAGTGGAAA 

CAATAGCCCT 

TGCCCCAGGC 

ACAAGGAGTG 

TTCCTACTAC 

CTCAGACAAT 

AAGGGGTCAG 

GGTCTGGGAC 

GAGACACAAG 

GCCAGGAGGC 

CAGCCGCCn 

ACCACCGCCC 

AAGCGTTAAT 

AAAGAAAGGT 

TTCCTGTTGC 

GAGCGCTCGC 


AGGGACTTTT 

60 

AATGGGACCA 

120 

AGACAGCGAA 

180 

GACCTCGAGG 

240 

TTGATGATGG 

300 

AGTCCAATTT 

360 

AATGCAGAAT 

420 

ATGGTTTATG 

480 

GCCATGCATA 

540 

GCAGCCGGGT 

600 

ACACTCCAAC 

660 

CACAGCAGCT 

720 

TGGCCGCTGG 

780 

GTTCCTCAGC 

840 

CACCCCTCAC 

900 

CCTGGAGCTT 

960 

GCAAAGTTTG 1020 

CGGGACGCTA 1080 

CCAGCCGCCG 

1140 




CCTCCAACCC CCTGAGGTTT CCGGGGACCA CA AtG AAC AAG/TTG CTG TGC TGC 1193 

^ Met Asn Lys Leu Leu Cys Cys 


GCG CTC GTG GTAAGTCCCT GGGCCAGCCG ACGGGTGGCC GGCGCCTGGG 
Ala Leu Val / 


GAGGCTGCTG CCACCTGGTC TCCCAACCTC CCAGCGGACC GGCGGGGAGA AGGCTCCACT 1302 
CGCTCCCTCC CAGG / 1316 


Sequence number: 2 J 

Length of sequence: 9898 / 

Sequence Type : nucleic acid 
Strandedness: double / 

Topology: linear / 

Molecular type: genomic/ DNA (human OCIF genomic DNA-2 ) 
Sequence : / 

GCTTACTTTG TGCCAAATCT CATTAGGCTT AAGGTAATAC AGGACTTTGA GTCAAATGAT 60 

ACrGTTGCAC ATAAGAACAA AGCTATTTTC ATGCTAAGAT GATGCCACTG TGTTCCTTTC 120 

i*o 7 

TCCTTCTAG tTT CTG GAC ATC TCC ATT AAG TGG ACC ACC CAG GAA ACG TTT 171 
Phe Leu Asp/lle Ser lie Lys Trp Thr Thr Gin Glu Thr Phe 


CCT CCA AAG TAC CTT CAT TAT GAC GAA GAA ACC TCT CAT CAG CTG TTG 219 

Pro Pro Lys Tyr Leu His Tyr Asp Glu GLu Thr Ser His Gin Leu Leu 
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TGT GAC AAA TGT CCT CCT GGT ACC TAC CTA AAA GAA CAC TGT ACA GCA 267 

Cys Asp Lys Cys Pro Pro Gly Thr Tyr Leu Lys/Gln His Cys Thr Ala 


AAG TGG AAG ACC GIG TGC GCC CCT TGC CCT GAC CAC TAC TAC ACA GAC 315 

Lys Trp Lys Thr Val Cys Ala Pro Cys Pro Asp His Tyr Tyr Thr Asp 


AGC TGG CAC ACC AGT GAC GAG TGT CTA TAC TGC AGC CCC GTG TGC AAG 
Ser Trp His Thr Ser Asp Glu Cys Leu Tyr Cys Ser Pro Yal Cys Lys 


GAG CTG CAG TAC GTC AAG CAG GAG TGC AAT CGC ACC CAC AAC CGC GTG 411 

Glu Leu Glo Tyr Val Lys Gin/ Glu Cys Asn Arg Thr His Asa Arg Val 


TGC GAA TGC AAG GAA GGG/CGC TAC CTT GAG ATA GAG TTC TGC TTG AAA 
Cys Glu Cys Lys Glu GLy Arg Tyr Leu Glu He Glu Phe Cys Leu Lys 


CAT AGG AGC TGC CCT CCT GGA TTT GGA GTG GTG CAA Gci G GTACGTGTCA 
His Arg Ser Cys Bro Pro Gly Phe Gly Val Val Gin Ala 


ATGTGCAGCA AAATTAATTA GGATCATGCA AAGTCAGATA GTTGTGACAG TTTAGGAGAA 569 
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CACTTTTGTT CTGATGACAT TATAGGATAG CAAATTGCAA AGGTAATGAA ACCTGCCAGG 629 
TAGGTACTAT GTGTCTGGAG TGCTTCCAAA yGGACCATTGC TCAGAGGAAT ACTTTGCCAC 689 
TACAGGGCAA TTTAATGACA AATCTCAAAy GCAGCAAATT ATTCTCTCAT CAGATGCATG 749 
ATGGTTTTTT TTTTTTTTTT TAAAGAAAOA AACTCAAGTT GCACTATTGA TAGTTGATCT 809 
ATACCTCTAT ATTTCACTTC AGCATGGACA CCTTCAAACT GCAGCACTTT 17GACAAACA 869 
TCAGAAATGT TAATTTATAC CAAGAGAGTA ATTATGCTCA TATTAATGAG ACTCTGGAGT 920 
GCTAACAATA AGCAGTTATA ATTAATTATG TAAAAAATGA GAATGGTGAG GGGAATTGCA 989 
TTTCATTATT AAAAACAAGG CTAGTtCTTC CTTTAGCATG GGAGCTGAGT GTTTGGGAGG 1049 
GTAAGGACTA TAGCAGAATC TCTTCAATGA GCTTATTCTT TATCTTAGAC AAAACAGATT 1109 
GTCAAGCCAA GAGCAAGCAC TTQCCTATAA ACCAAGTGCT TTCTCTTTTG CATTTTGAAC 1169 
AGCATTGGTC AGGGCTCATG TOTATTGAAT CTTTTAAACC AGTAACCCAC GTTTTTTTTC 1229 
TGCCACATTT GCGAAGCTTC AGTGCAGCCT ATAACTTTTC ATAGCTTGAG AAAATTAAGA 1289 
GTATCCACTT ACTTAGATGG MAGAAGTAAT CAGTATAGAT TCTGATGACT CAGTTTGAAG 1349 
CAGTGTTTCT CAACTGAAGC/ CCTGCTGATA TTTTAAGAAA TATCTGGATT CCTAGGCTGG 1409 
ACTCCTTTTT GTGGGCAGOT GTCCTGCGCA TTGTAGAATT TTGGCAGCAC CCCTGGACTC 1469 
TAGCCACTAG ATACCAATAG CAGTCCTTCC CCCATGTGAC AGCCAAAAAT GTCTTCAGAC 1529 
ACTGTCAAAT GTCGCCAGGT GGCAAAATCA CTCCTGGTTG AGAACAGGGT CATCAATGCT 1589 
AAGTATCTGT AACTATOTTA ACTCTCAAAA CTTGTGATAT ACAAAGTCTA AATTATTAGA 1649 
CGACCAATAC mAGGTTTA AAGGCATACA AATGAAACAT TCAAAAATCA AAATCTATTC 1709 
TGTTTCTCAA ATA0TGAATC TTATAAAATT AATCACAGAA GATGCAAATT GCATCAGAGT 1769 
CCCTTAAAAT TCC/TCTTCGT ATGAGTATTT GAGGGAGGAA TTGGTGATAG TTCCTACTTT 1829 
CTATTGGATG GTACTTTGAG ACTCAAAAGC TAAGCTAAGT TGTGTGTGTG TCAGGGTGCG 1889 
GGGTGTGGAA TCCCATCAGA TAAAAGCAAA TCCATGTAAT TCATTCAGTA AGHGTATAT 1949 
GTAGAAAAAT GAAAAGTGGG CTATGCAGCT TGGAAACTAG AGAATTTTGA AAAATAATGG 2009 
AAATCACAAG/GATCTTTCTT AAATAAGTAA GAAAATCTGT TTGTAGAATG AAGCAAGCAG 2069 
GCAGCCAGAA GACTCAGAAC AAAAGTACAC ATTTTACTCT GTGTACACTG GCAGCACAGT 2129 
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yi 

SI 


F 1 ; 

in 


03 


i 


GGGATTTATT TACCTCTCCC TCCCTAAAAA CCCACACAGC GGTTCCTCTT GGGAAATAAG 2189 
AGGTTTCCAG CCCAAAGAGA AGGAAAGACT ATGTGGTGTT/ACTCTAAAAA GTATTTAATA 2249 
ACCGTTTTGT TGTTGCTGTT GCTGTTTTGA AATCAGATTG TCTCCTCTCC ATATTTTATT 2309 
TACTTCATTC TGTTAATTCC TGTGGAATTA CTTAGAGjCAA GCATGGTGAA TTCTCAACTG 2369 
TAAAGCCAAA TTTCTCCATC ATTATAATTT CACAmTGC CTGGCAGGTT ATAATTTTTA 2429 
TATTTCCACT GATAGTAATA AGGTAAAATC ATTACTTAGA TGGATAGATC TTTTTCATAA 2489 
AAAGTACCAT CAGTTATAGA GGGAAGTCAT GrrCATGTTC AGGAAGGTCA TTAGATAAAG 2549 
CTTCTGAATA TATTATGAAA CATTAGTTCT GrCATTCTTA GATTCTTTTT GTTAAATAAC 2609 
TTTAAAAGCT AACTTACCTA AAAGAAATAt/oTGACACATA TGAACTTCTC ATTAGGATGC 2669 
AGGAGAACAC CCAAGCCACA GATATGTAJC TGAAGAATGA ACAAGATTCT TAGGCCCGGC 2729 
ACGGTGGCTC ACATCTGTAA TCTCAAg/gT TTGAGAGGTC AAGGCGGGCA GATCACCTGA 2789 
GGTCAGGAGT TCAAGACCAG CCTGGOCAAC ATGATGAAAC CCTGCCTCTA CTAAAAATAC 2849 
AAAAATTAGC AGGGCATGGT GGTg/aTGCC TGCAACCCTA GCTACTCAGG AGGCTGAGAC 2909 
AGGAGAATCT CTTGAACCCT CGAGGCGGAG GTTGTGGTGA GCTGAGATCC CTCTACTGCA 2969 
CTCCAGCCTG GGTGACAGAG ATCAGACTCC GTCCCTGCCG CCGCCCCCGC CTTCCCCCCC 3029 
AAAAAGATTC TTCTTCATGC /GAACATACG GCAGTCAACA AAGGGAGACC TGGGTCCAGG 3089 
TGTCCAAGTC ACTTATTTCy AGTAAATTAG CAATGAAAGA ATGCCATGGA ATCCCTGCCC 3149 
AAATACCTCT GCTTATGa/a TTGTAGAATT TGATATAGAG TTGTATCCCA TTTAAGGAGT 3209 
AGGATGTAGT AGGAAAGTAC TAAAAACAAA CACACAAACA GAAAACCCTC TTTGCTTTGT 3269 
AAGGTGGTTC CTAAg/taaT GTCAGTGCAA TGCTGGAAAT AATATTTAAT ATGTGAAGGT 3329 
TTTAGGCTGT GTTDTCCCCT CCTGTTCTTT TTTTCTGCCA GCCCTTTGTC ATTTTTGCAG 3389 
GTCAATGAAT CATGTAGAAA GAGACAGGAG ATGAAACTAG AACCAGTCCA TTTTGCCCCT 3449 
TTTTTTATTT TJ6TGGTTTTG GTAAAAGATA CAATGAGGTA GGAGGTTGAG ATTTATAAAT 3509 
GAAGTTTAAT MGTTTCTGT AGCTTTGATT TTTCTCTTTC ATATTTGTTA TCTTGCATAA 3569 
GCCAGAATW GCCTGTAAAA TCTACATATG GATATTGAAG TCTAAATCTG TTCAACTAGC 3629 
TTACACTAGA TGGAGATATT TTCATATTCA GATACACTGG AATGTATGAT CTAGCCATGC 3689 


25 



GTAATATAGT CAAGTGTTTG AAGGTATTTA TTTTTAATAG CGtCTTTAGT TGTGGACTGG 3749 
TTCAAGTTTT TCTGCCAATG ATTTCTTCAA ATTTATCAAA 7ATTTTTCCA TCATGAAGTA 3809 
AAATGCCCTT GCAGTCACCC TTCCTGAAGT TTGAACGACy CTGCTGTTTT AAACAGTTTA 3869 
AGCAAATGGT ATATCATCTT CCGTTTACTA TGTAGCTTAA CTGCAGGCTT ACGCTTTTGA 3929 
GTCAGCGGCC AACTTTATTG CCACCTTCAA AAGTTTATTA TAATGTTGTA AATTTTTACT 3989 
TCTCAAGGTT AGCATACTTA GGAGTTGCTT CACAATTAGG ATTCAGGAAA GAAAGAACTT 4049 
CAGTAGGAAC TGATTGGAAT TTAATGATGC AGCAITCAAT GGGTACTAAT TTCAAAGAAT 4109 
GATATTACAG CAGACACACA GCAGTTATCT TGAnTTCTA GGAATAATTG TATGAAGAAT 4169 
ATGGCTGACA ACACGGCCTT ACTGCCACTC AGCGGAGGCT GGACTAATGA ACACCCTACC 4229 
CTTCTTTCCT TTCCTCTCAC ATTTCATGAG OGTTTTGTAG GTAACGAGAA AATTGACTTG 4289 
CATTTGCATT ACAAGGAGGA GAAACTGGCA MAGGGGATGA TGGTGGAAGT TTTGTTCTGT 4349 
CTAATGAAGT GAAAAATGAA AATGCTAGAt/ TTTTGTGCAA CATAATAGTA GCAGTAAAAA 4409 
CCAAGTGAAA AGTCTTTCCA AAACTGTGTT AAGAGGGCAT CTGCTGGGAA ACGATTTGAG 4469 

f 4 

GAGAAGGTAC TAAATTGCTT GGTATTTTCC GTA t GA ACC CCA GAG CGA AAT ACA 4523 

/ Gly Thr Pro Glu Arg Asn Thr 

/ 115 


GTT 

TGC 

AAA 

AGA 

TGT 

CCA 

GAT 

GGG 

TTC 

TTC 

TCA 

AAT 

GAG 

ACG 

TCA 

TCT 

4571 

Val 

Cys 

Lys 

Arg 

Cys 

Pro 

Asp 

Gly 

Phe 

Phe 

Ser 

Asn 

Glu 

Thr 

Ser 

Ser 


120 





125) 

I 




130 





135 


AAA 

GCA 

CCC 

TGT 

AGA 

hL 

CAC 

ACA 

AAT 

TGC 

ACT 

GTC 

TTT 

GGT 

CTC 

CTG 

4619 

Lys 

Ala 

Pro 

Cys 

Arg 

i 

^ys 

His 

Thr 

Asn 

Cys 

Ser 

Val 

Phe 

Gly 

Leu 

Leu 






140/ 





145 





150 



CTA 

ACT 

CAG 

AAA 

GjGA 

AAT 

GCA 

ACA 

CAC 

GAC 

AAC 

ATA 

TGT 

TCC 

GGA 

AAC 

4667 
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Leu Thr Gin Lys GlyAsn Ala Thr His Asp Asn/lle Cys Ser Gly Asn 


155 


160 

AT/1 


165 


a. 


AGT GAA TCA ACT CAA AAA TGT GCA ATA G GJAATTACAT TCCAAAATAC 
Ser Glu Ser Thr Gin Lys Cys Gly lie / 

170 175 


4715 


GTCTTTGTAC GATTTTGTAG TATCATCTCT OfCTCTGAGT TGAACACAAG GCCTCCACCC 4775 
ACATTCTTGG TCAAACTTAC ATTTTCCCTT /TCTTGAATCT TAACCAGCTA AGGCTACTCT 4835 
CGATGCATTA CTGCTAAAGC TACCACTCAtf AATCTCTCAA AAACTCATCT TCTCACAGAT 4895 
AACACCTCAA AGCTTGATTT TCTCTCCTTT CACACTGAAA TCAAATCTTG CCCATAGGCA 4955 
AAGGGCAGTG TCAAGTTTGC CACTGAGATG AAATTAGGAG AGTCCAAACT GTAGAATTCA 5015 
CGTTGTGTGT TATTACTTTC ACGAA7GTCT GTATTATTAA CTAAAGTATA TATTGGCAAC 5075 
TAAGAAGCAA AGTGATATAA ACATOATCAC AAATTAGGCC AGGCATGGTG GCTTACTCCT 5135 
ATAATCCCAA CATTTTGGGG GGC(MAGGTA GGCAGATCAC TTGAGGTCAG GATTTCAAGA 5195 
CCAGCCTGAC CAACATGGTG AAACCTTGTC TCTACTAAAA ATACAAAAAT TAGCTGGGCA 5255 
TGGTAGCAGG CACTTCTAGT ACCAGCTACT CAGGGCTGAG GCAGGAGAAT CGCTTGAACC 5315 
CAGGAGATGG AGGTTGCAGT /GAGCTGAGAT TGTACCACTG CACTCCAGTC TGGGCAACAG 5375 
AGCAAGATTT CATCACACA0 ACACACACAC ACACACACAC ACACATTAGA AATGTGTACT 5435 
TGGCTTTGTT ACCTATGGtA TTAGTGCATC TATTGCATGG AACTTCCAAG CTACTCTGGT 5495 
TGTGTTAAGC TCTTCAIftGG GTACAGGTCA CTAGTATTAA GTTCAGGTTA TTCGGATGCA 5555 
TTCCACGGTA GTGATQACAA TTCATCAGGC TAGTGTGTGT GTTCACCTTG TCACTCCCAC 5615 
CACTAGACTA ATCT0AGACC TTCACTCAAA GACACATTAC ACTAAAGATG ATTTGCTTTT 5675 
TTGTGTTTAA TCAAGCAATG GTATAAACCA GCTTGACTCT CCCCAAACAG TTTTTCGTAC 5735 
TACAAAGAAG TTTATGAAGC AGAGAAATGT GAATTGATAT ATATATGAGA TTCTAACCCA 5795 
GTTCCAGCAT TGTTTCATTG TGTAATTGAA ATCATAGACA AGCCATTTTA GCCTTTGCTT 5855 
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TCTTATCTAA AAAAAAAAAA AAAAAAATGA AGGAAGGGGyATTAAAAGGA GTGATCAAAT 5915 
TTTAACATTC TCTTTAATTA ATTCATTTTT AATTTTACTT TTTTTCATTT ATTGTGCACT 5975 
TACTATGTGG TACTGTGCTA TAGAGGCTTT AACATTTATA AAAACACTGT GAAAGTTGCT 6035 
TCAGATGAAr ATAGGTAGTA GAACGGCAGA ACTAGTATTC AAAGCCAGGT CTGATGAATC 6095 
CAAAAACAAA CACCCATTAC TCCCATTTTC TGGJMCATAC TTACTCTACC CAGATGCTCT 6155 
GGGCTTTGTA ATGCCTATGT AAATAACATA GTTTTATGTT TGGTTATTTT CCTATGTAAT 6215 
GTCTACTTAT ATATCTGTAT CTATCTCTTG yCTTTGTTTCC AAAGGTAAAC TATGTGTCTA 6275 
AATGTGGGCA AAAAATAACA CACTATTCc/ AATTACTGTT CAAATTCCTT TAAGTCAGTG 6335 
ATAATTATTT GTTTTGACAT TAATCATGAA CTTCCCTGTG GGTACTAGGT AAACCTTTAA 6395 
TAGAATGTTA ATGTTTGTAT TCATTATaAG AATTTTTGGC TGTTACTTAT TTACAACAAT 6455 
ATTTCACTCT AATTAGACAT TTACXAAACT TTCTCTTGAA AACAATGCCC AAAAAAGAAC 6515 
ATTAGAAGAC ACGTAAGCTC AGHGCTCTC TGCCACTAAG ACCAGCCAAC AGAAGCTTGA 6575 
TTTTATTCAA ACTTTGCATT TTAGCATATT TTATCTTGGA AAATTCAATT GTGTTGGTTT 6635 
TTTGTTTTTG TTTGTATTGA ATAGACTCTC AGAAATCCAA TTGTTGAGTA AATCTTCTGG 6695 
GTTTTCTAAC CTTTCTTTAG/aT GTT ACC CTG TGT GAG GAG GCA TTC TTC AGG 6747 


5 


r Asp Val Thr Leu Cys Glu Glu Ala Phe Phe Arg 


TTT GCT GTT CCT ACA AAG TTT ACG CCT AAC TGG CH AGT GTC TTG GTA 6795 

Phe Ala Val Pr/ Thr Lys Phe Thr Pro Aso Trp Leu Ser Val Leu Val 


GAC AAT TTy CCT GGC ACC AAA GTA AAC GCA GAG AGT GTA GAG AGG ATA 6843 

Asp Asn Leu Pro Gly Thr Lys Val Asn Ala Glu Ser Val Glu Arg lie 
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6891 


AAA 

CGG 

CAA 

CAC 

AGC 

TCA 

CAA 

GAA 

CAG 

ACT 

TTC 

CAG 

CTG 

CTG 

AAG 

TTA 

Lys 

Arg 

Gin 

His 

Ser 

Ser 

Gin 

Glu 

Gin 

Thr 

Phe 

g/i 

Leu 

Leu 

Lys 

Leu 

220 





225 





230y 

' 




235 

TGG 

AAA 

CAT 

CAA 

AAC 

AAA 

GAC 

CAA 

GAT 

ATA 

m 

AAG 

AAG 

ATC 

ATC 

CAA G 

Trp 

Lys 

His 

Gin 

Asn 

Lys 

Asp 

Gin 

Asp 

lie/ 

Val 

Lys 

Lys 

lie 

lie 

Gin 


240 245 250 


GTATGATAAT CTAAAATAAA AAGATCAATC AGAAATCAAA GACACCTATT TATCATAAAC- 7000 
CAGGAACAAG ACTGCATGTA TGTTTAGTTG /gTGGATCTT GTTTCCCTGT TGGAATCATT 7060 
GTTGGACTGA AAAAGTTTCC ACCTGATAAy GTAGATGTGA TTCCACAAAC AGTTATACAA 7120 
GGTTTTGTTC TCACCCCTGC TCCCCAGTTT CCTTGTAAAG TATGTTGAAC ACTCTAAGAG 7180 
AAGAGAAATG CATTTGAAGG CAGGGCTGfTA TCTCAGGGAG TCGCTTCCAG ATCCCTTAAC 7240 
GCTTCTGTAA GCAGCCCCTC TAGACc/cCA AGGAGAAGCT CTATAACCAC TTTGTATCTT 7300 
ACATTGCACC TCTACCAAGA AGCTCffGTTG TATTTACTTG GTAATTCTCT CCAGGTAGGC 7360 
TTTTCGTAGC TTACAAATAT GTTOTTATTA ATCCTCATGA TATGGCCTGC ATTAAAATTA 7420 
TTTTAATGGC ATATGTTATG AGAATTAATG AGATAAAATC TGAAAAGTGT TTGAGCCTCT 7480 
TGTAGGAAAA AGCTAGTTAC AjSCAAAATGT TCTCACATCT TATAAGTTTA TATAAAGATT 7540 
CTCCTTTAGA AATGGTGTGA GAGAGAAACA GAGAGAGATA GGGAGAGAA G TGTGAAAGAA 7600 
TCTGAAGAAA AGGAGTTTCa/tCCAGTGTGG ACTGTAAGCT TTACGACACA TGATGGAAAG 7660 
AGTTCTGACT TCAGTAAGOA TTGGGAGGAC ATGCTAGAAG AAAAAGGAAG AAGAGTTTCC 7720 
ATAATGCAGA CAGGGTCAGT GAGAAATTCA TTCAGGTCCT CACCAGTAGT TAAATGACTG 7780 
TATAGTCTTG CACTACj/cTA AAAAACTTCA AGTATCTGAA ACCCGGGCAA CAGATTTTAG 7840 
GAGACCAACG TCTTTGAGAG CTGATTGCTT TTGCTTATGC AAAGAGTAAA CTTTTATGTT 7900 
TTGAGCAAAC CAAAAGTATT CTTTGAACGT ATAATTAGCC CTGAAGCCGA AAGAAAAGAG 7960 
AAAATCAGAG ACCOTTAGAA TTGGAAGCAA CCAAATTCCC TATTTTATAA ATGAGGACAT 8020 
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j?“- 

yj 


TTTAACCCAG AAAGATGAAC CGATTTGGCT TAGGGCTCAC A0ATACTAAG TGACTCATGT 8080 
CATTAATAGA AATGTTAGTT CCTCCCTCTT AGGTTTGTAC/CCTAGCTTAT TACTGAAATA 8140 
TTCTCTAGGC TGTGTGTCTC CTTTAGTTCC TCGACCTCAT GTCTTTGAGT TTTCAGATAT 8200 
CCTCCTCATG GAGGTAGTCC TCTGGTGCTA TGTGTApTCT TTAAAGGCTA GTTACGGCAA 8260 
TTAACTTATC AACTAGCGCC TACTAATGAA ACTTpGTATT ACAAAGTAGC TAACTTGAAT 8320 
ACTTTCCTTT TTTTCTGAAA TGTTATGGTG GTMTTTCTC AAACTTTTTC TTAGAAAACT 8380 
GAGAGTGATG TGTCTTATTT TCTACTGTTA A7TTTCAAAA TTAGGAGCTT CTTCCAAAGT 8440 
TTTGTTGGAT GCCAAAAATA TATAGCATAT /tATCTTATTA TAACAAAAAA TATTTATCTC 8500 
AGTTCTTAGA AATAAATGGT GTCACTTAA0 TCCCTCTCAA AAGAAAAGGT TATCATTGAA 8560 
ATATAATTAT GAAATTCTGC AAGAACCXTT TGCCTCACGC TTGTTTTATG ATGGCATTGG 8620 
ATGAATATAA ATGATGTGAA CACTTA^CTG GGCTTTTGCT TTATGCAG AT ATT GAC 8676 

Asp He Asp 


CTC TGT GAA AAC AGC GTG GAG CGG CAC ATT GGA CAT GCT AAC CTC ACC 
Leu Cys Glu Asn Ser Val/Gln Arg His lie Gly His Ala Asn Leu Thr 
255 280 265 270 


8724 


f&. \ 


TTC. GAG CAG CTT CGt/aGC TTG ATG GAA AGC TTA CCG GGA AAG AAA GTG 
Phe Glu Gin Leu A^g Ser Leu Uet Glu Ser Leu Pro Gly Lys Lys Val 

280 285 


8772 


GGA GCA GAA GAC ATT GAA AAA ACA ATA AAG GCA TGC AAA CCC ACT GAC 
Gly Ala Glu ^sp lie Glu Lys Thr lie Lys Ala Cys Lys Pro Ser Asp 
^290 295 300 


8820 


CAG ATC CTG AAG CTG CTC AGT TTG TGG CGA ATA AAA AAT GGC GAC CAA 8868 
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Gin lie Leu Lys Leu Leu Ser Leu Trp Arg Ile/Lys Asn Gly Asp Gin 
305 310 / 315 

GAC ACC TTG AAG GGC CTA ATG CAC GCA CTA AAG CAC TCA AAG ACG TAC 8916 

Asp Thr Leu Lys Gly Leu Met His Ala |4u Lys His Ser Lys Thr Tyr 

320 325 / 330 

CAC TTT CCC AAA ACT GTC ACT CAG /gT CTA AAG AAG ACC ATC AGG TTC 8964 

His Phe Pro Lys Thr Val Thr Gln/ser Leu Lys Lys Thr lie Arg Phe 

335 340 / 345 350 

CTT CAC AGC TTC ACA ATG TACMAA TTG TAT CAG AAG TTA TTT TTA GAA 9012 

Leu His Ser Phe Thr Met Tra Lys Leu Tyr Gin Lys Leu Phe Leu Glu 

355 / 360 365 

ATG ATA GGT AAC CAG GTC CAA TCA GTA AAA ATA AGC TGC TTA 9054 

Met He Gly Asn Gin Val Gin Ser Val Lys lie Ser Cys Leu 

370 / 375 380 

TAACTGGAAA TGGCCATTGA GCTGTTTCCT CACAATTGGC GAGATCCCAT GGATGAGTAA 9114 
ACTGTTTCTC AGGCACTTGA GGCTTTCAGT GATATCTTTC TCATTACCAG TGACTAATTT 9174 
TGCCACAGGG TAC1AAAAGA AACTATGATG TGGAGAAAGG ACTAACATCT CCTCCAATAA 9234 
ACCCCAAATG Gt/aATCCAA CTGTCAGATC TGGATCGTTA TCTACTGACT ATATTTTCCC 9294 
TTATTACTGC TTGCAGTAAT TCAACTGGAA ATTAAAAAAA AAAAACTAGA CTCCACTGGG 9354 
CCTTACTAAA /TATGGGAATG TCTAACTTAA ATAGCTTTGG GATTCCAGCT ATGCTAGAGG 9414 
CTTTTATTAt/ AAAGCCATAT TTTTTTCTGT AAAAGTTACT AATATATCTG TAACACTATT 9474 
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ACAGTATTGC 

GAAACGGTAT 

GAGAAAATAT 

TTTTCTGTGT 

AAATGCATTA 

ATATTAGATG 

TATATAAATG 

ATTT 


TATTTATATT 

GACTTAATTT 

ATATTTTTAA 

GCAGTATTTT 

TTTAGTCAAT 

CTCTGAGAAA 

ACATTATTAA 


CATTCAGATA 

TAGAAAGAAA 

TGGAAAGTTT 

TATAATTTTA 

TGTTTAATGT 

TTGAATGTAC 

AGTTTTCAAA 


TAAGATTTGG A0ATATTATC 
ATTATATTCT/GTTTATTATG 
GTAGCATTPT TCTAATAGGT 
tctgtaxaag CTGTAATATC 
TGGAAMCAT atgaaatata 
TAAA AGATTTTATff 
TTTTTTA TTGCTTTCrC 


ATCCTATAAA 9534 
ACAAATGAAA 9594 
ACTGCCATAT 9654 
ATTTTATAGA 9714 
AATTATCTGA 9774 
GTTTTATAAC 9834 
TGTTGCTTTT 9894 
9898 


Sequence number: 3 
Length of sequence: 401, 
Sequence Type: amino acid 
Strandedness: single ^tranded 
Topology: linear 
Molecular type: protein 


Sequence : 

Net Asa Asa Leu Leu 0ys Cys Ala Leu Val Phe Leu Asp He Ser 
-20 / -15 -10 

lie Lys Trp Thr Thrf Gin Glu Thr Phe Pro Pro Lys Tyr Leu His 

-5 / 1 5 

Tyr Asp Glu Glu Tnr Ser His Gin Leu Leu Cys Asp Lys Cys Pro 

10 / 15 20 

Pro Gly Thr Tyr /leu Lys Gin His Cys Thr Ala Lys Trp Lys Thr 
25 / 30 35 

Val Cys Ala Pr p Cys Pro Asp His Tyr Tyr Thr Asp Ser Trp His 
40 / 45 50 
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lie Gin Asp lie Asp Leu Cys Glu Asir Ser Val Gin Arg His lie 


Gly His Ala Asn Leu Thr Phe Glu /Gin Leu Arg Ser Leu Met Glu 


Ser Leu Pro Gly Lys Lys Val Gly Ala Glu Asp lie Glu Lys Thr 


He Lys Ala Cys Lys Pro Ser Asp Gin lie Leu Lys Leu Leu Ser 



Leu Trp Arg He Lys Asn Gly Asp Gin Asp Thr Leu Lys Gly Leu 


Met His Ala Leu Lys HLs Ser Lys Thr Tyr His Phe Pro Lys Thr 


Val Thr Gin Ser Leu lys Lys Thr lie Arg Phe Leu His Ser Phe 


Thr Met Tyr Lys Leu Tyr Gin Lys Leu Phe Leu Glu Met He Gly 


Asn Gin Val Gin Ger Val Lys lie Ser Cys Leu 


Sequence number: 4 
Length of sequence: 1206 
Sequence Type : nucleic acid 
Strandedness: single stranded 
Topology: linear 
Molecular /type : cDNA 





•szs? 

ITi 


gi 


tj 


m 


Sequence : 

ATGAACAACT TGCTGTGCTG CGCGCTCGTG TTTCTGGACy TCTCCATTAA GTGGACCACC 


60 


CAGGAAACGT TTCCTCCAAA GTACCTTCAT TATGACGAAG AAACCTCTCA TCAGCTGTTG 120 
TGTGACAAAT GTCCTCCTGG TACCTACCTA AAACAACACT GTACAGCAAA GTGGAAGACC 180 
GTGTGCGCCC CTTGCCCTGA CCACTACTAC ACAGACAGCT GGCACACCAG TGACGAGTGT 240 
CTATACTGCA GCCCCGTGTG CAAGGAGCTG CAGTACGTCA AGCAGGAGTG CAATCGCACC 300 
CACAACCGCG TGTGCGAATG CAAGGAAGGG 0GCTACCTTG AGATAGAGTT CTGCTTGAAA 360 
CATAGGAGCT GCCCTCCTGG ATTTGGAGTj/ GTGCAAGCTG GAACCCCAGA GCGAAATACA 420 
GTTTGCAAAA GATGTCCAGA TGGGTTCmC TCAAATGAGA CGTCATCTAA AGCACCCTGT 480 
AGAAAACACA CAAATTGCAG TGTCTmlGT CTCCTGCTAA CTCAGAAAGG AAATGCAACA 540 
CACGACAACA TATGTTCCGG AAACAGTGAA TCAACTCAAA AATGTGGAAT AGATGTTACC 600 
CTGTGTGAGG AGGCATTCTT CAGGTTTGCT GTTCCTACAA AGTTTACGCC TAACTGGCTT 660 
AGTGTCTTGG TAGACAATTT GGCTGGCACC AAAGTAAACG CAGAGAGTGT AGAGAGGATA 720 
AAACGGCAAC ACAGCTCACA /cAACAGACT TTCCAGCTGC TGAAGTTATG GAAACATCAA 780 
AACAAAGACC AAGATATAGT CAAGAAGATC ATCCAAGATA TTGACCTCTG TGAAAACAGC 840 
GTGCAGCGGC ACATTGGACA TGCTAACCTC ACCTTCGAGC AGCTTCGTAG CTTGATGGAA 900 
AGCTTACCGG GAAAGAAAGT GGGAGCAGAA GACATTGAAA AAACAATAAA GGCATGCAAA 960 
CCCAGTGACC AGATfifCTGAA GCTGCTCAGT TTGTGGCGAA TAAAAAATGG CGACCAAGAC 1020 
ACCTTGAAGG GCGTAATGCA CGCACTAAAG CACTCAAAGA CGTACCACTT TCCCAAAACT 1080 
GTCACTCAGA g/cTAAAGAA GACCATCAGG TTCCHCACA GCHCACAAT GTACAAATTG 1140 
TATCAGAAGT /TATTTTTAGA AATGATAGGT AACCAGGTCC AATCAGTAAA AATAAGCTGC 1200 
TTATAA / 1206 
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